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J5 ' Continually arriving information is communicated through a network of n agents, with 

the value of information to the j'th recipient being a decreasing function oi j/n, and commu- 
nication costs paid by recipient. Regardless of details of network and communication costs, 
the social optimum policy is to communicate arbitrarily slowly. But selfish agent behavior 
^ ' leads to Nash equilibria which (in the n — ?► oo limit) may be efficient (Nash payoff = social 

^O ! optimum payoff) or wasteful (0 < Nash payoff < social optimum payoff) or totally wasteful 

'^ ' (Nash payoff = 0). We study the cases of the complete network (constant communication 

costs between all agents), the grid with only nearest- neighbor communication, and the grid 
with communication cost a function of distance. The main technical tool is analysis of the 
associated first passage percolation process or SI epidemic (representing spread of one item 



>-«s ' of information) and in particular its "window width" , the time interval during which most 

agents learn the item. 

Many arguments arc just outlined, not intended as complete rigorous proofs. This version 
was written in July 2007 to accompany a talk at the ICTP workshop "Common Concepts in 
^ ' Statistical Physics and Computer Science" , and intended as a starting point for future thesis 

JH I projects which could explore these and many variant problems in detail. One of the topics 

herein (first passage percolation on the N x N torus with short and long range interactions; 
section 16. 2p has now been studied rigorously by Chatterjee and Durrett [1] , and so it seems 
appropriate to make this version publicly accessible. 
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1 Introduction 

A topic which one might loosely call "random percolation of information through networks" arises 
in many different contexts, from epidemic models [5] and computer virus models [lOj to gossip 
algorithms [8] designed to keep nodes of a decentralized network updated about information 
needed to maintain the network. This topic differs from communication networks in that we 
envisage information as having a definite source but no definite destination. 

In this paper we study an aspect where the vertices of the network are agents, and where there 
are costs and benefits associated with the different choices that agents may make in communi- 
cating information. In such "economic game theory" settings one anticipates a social optimum 
strategy that maximizes the total net payoff to all agents combined, and an (often different) Nash 
equilibrium characterized by the property that no one agent can benefit from deviating from the 
Nash equilibrium strategy followed by all other agents (so one anticipates that any reasonable 
process of agents adjusting strategies in a selfish way will lead to some Nash equilibrium). Of 
course a huge number of different models of costs, benefits and choices could fit the description 
above, but we focus on the specific setting where the value to you of receiving information de- 
pends on how few people know the information before you do. Two familiar real world examples 
are gossip in social networks and insider trading in financial markets. In the first, the gossiper 
gains perceived social status from transmitting information, and so is implicitly willing to pay for 
communicate to others; in the second the owner of knowledge recognizes its value and implictly 
expects to be paid for communication onwards. Our basic model makes the simpler assump- 
tion that the value to an agent attaches at the time information is received, and subsequently 
the agent takes no initiative to communicate it to others, but does so freely when requested, 
with the requester paying the cost of communication. In our model the benefits come from, and 
communication costs are paid to, the outside world: there are no payments between agents. 

Remark. Many arguments are just outlined, not intended as complete rigorous proofs. This 
version was written in July 2007, and intended as a starting point for future thesis projects which 
could explore these and many variant problems in detail. One of the topics herein (first passage 
percolation on the N x N torus with short and long range interactions; section [6^ has now been 
studied rigorously by Chatterjee and Durrett [4], and so it seems appropriate to make this version 
publicly accessible. 



1.1 The general framework: a rank-based reward game 

There are n agents (our results are in the n — )■ oo limit). The basic two rules are: 

Rule 1. New items of information arrive at times of a rate-1 Poisson process; each item comes 
to one random agent. 

Information spreads between agents by virtue of one agent calling another and learning all 
items that the other knows (details are case-specific, described later), with a (case-specific) com- 
munication cost paid by the receiver of information. 

Rule 2. The j'th person to learn an item of information gets reward R{-)- 

Here R{u), < n < 1 is a function such that 



R{u) is decreasing; R{1) = 0; < R := R{u)du < oo. (1) 



:= / R{u)di 
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Assuming information eventually reaches each agent, the total reward from each item will be 
^"^i R{n) ~ '^-^- If agents behave in some "exchangeable" way then the average net payoff (per 
agent per unit time) is 

payoff = R — (average communication cost per agent per unit time). (2) 

Now the average communication cost per unit time can be made arbitrarily small by simply 
communicating less often (because an agent learns all items that another agent knows, for the 
cost of one call. Note the calling agent does not know in advance whether the other agent has any 
new items of information). Thus the "social optimum" protocol is to communicate arbitrarily 
slowly, giving payoff arbitrarily close to R. But if agents behave selfishly then one agent may 
gain an advantage by paying to obtain information more quickly, and so we seek to study Nash 
equilibria for selfish agents. In particular there are three qualitative different possibilities. In the 
n — )■ oo limit, the Nash equilibrium may be 

• efficient (Nash payoff = social optimum payoff) 

• or wasteful (0 < Nash payoff < social optimum payoff) 

• or totally wasteful (Nash payoff = 0). 



1.2 Methodology 

Allowing agents' behaviors to be completely general makes the problems rather complicated (e.g. 
a subset of agents could seek to coordinate their actions) so in each specific model we restrict 
agent behavior to be of a specified form, making calls at random times with a rate parameter 9; 
the agent's "strategy" is just a choice of 0, and for this discussion we assume is a single real 
number. If all agents use the same parameter value 6 then the spread of one item of information 
through the network is as some model-dependent first passage percolation process (see section 
12. 2p . So there is some function Fg^ri{t) giving the proportion of agents who learn the item within 
time t after the arrival of the information into the network. Now suppose one agent ego uses 
a different parameter value (j) and gets some payoff-per-unit-time, denoted by payof[{(j),6). The 
Nash equilibrium value 0^'^''^ is the value of 9 for which ego cannot do better by choosing a 
different value of 6, and hence is the solution of 



-—payoff ((/),( 



0. (3) 



Obtaining a formula for payoff ((/), 0) requires knowing -Fe,n(^) and knowing something about the 
geometry of the sets of informed agents at time t ~ see ()19|26p for the two basic examples. The 
important point is that where we know the exact n — )• oo limit behavior of Fq n{t) we get a 
formula for the exact limit 9^^^^"^, and where we know order of magnitude behavior of Fg^n{t) we 
get order of magnitude behavior of 0^=^*. 

Note that we have assumed that in a Nash equilibrium each agent uses the same strategy. 
This is only a sensible assumption when the network cost structure has enough symmetry (is 
transitive - see section 17. ip and the non-transitive case is an interesting topic for future study. 



It turns out (section [3|) that for determining the qualitative behavior of the Nash equilibria, 
the important aspect is the size of the window width wg^n of the associated first passage perco- 
lation process, that is the time interval over which the proportion of agents knowing the item 
of information increases from (say) 10% to 90%. While this is well understood in the simplest 
examples of first passage percolation on finite sets, it has not been studied for very general models 
and our game-theoretic questions provide motivation for future such study. 



To interpret later formulas it turns out to be convenient to work with the derivative of R. 
Write R'{u) = —r{u), so that R{u) = J r{s)ds and ([1]) becomes 

r{u) > 0; < R:= ur{u)du < oo. (4) 

Jo 

1.3 Summary of results 

1.3.1 The complete graph case 

Network communication model: Each agent i may, at any time, call any other agent j (at 
cost 1), and learn all items that j knows. 

Poisson strategy. The allowed strategy for an agent i is to place calls, at the times of a Poisson 
(rate 6) process, to a random agent. 

Result (section [2]) . In the n — )■ oo limit the Nash equilibrium value of 9 is 

^Nash ^ /" (1 + iog(i _ u))R{u)du = [ r{u)g{u)du (5) 

JO Jo 

where g{u) = — (1 — n) log(l — n) > 0. 

Our assumptions ([T|) on R{u) imply < 0'^''* < R. Because an agent's average cost per unit 
time equals his value of 9, from ([2|) the Nash equilibrium payoff R — 9^'''''^ is strictly less than the 
social optimum payoff R but strictly greater than 0. So this is a "wasteful" case. 

1.3.2 The nearest neighbor grid 

Network communication model: Agents are at the vertices of the N x N torus (i.e. the grid 
with periodic boundary conditions). Each agent i may, at any time, call any of the 4 neighboring 
agents j (at cost 1), and learn all items that j knows. 

Poisson strategy. The allowed strategy for an agent i is to place calls, at the times of a Poisson 
(rate 9) process, to a random neighboring agent. 

Result (section [3]) . The Nash equilibrium value of 9 is such that as N ^ oo 

0N-h _ ^-1 f ^(^)^(^)(^^ (6) 



where g{u) > is a certain complicated function - see (|28p . 

So here the Nash equilibrium payoff R — 9^^"^ tends to R; this is an "efficient" case. 

1.3.3 Grid with communication costs increasing w^ith distance 

Netw^ork communication model. The agents are at the vertices of the N x N torus. Each 
agent i may, at any time, call any other agent j, at cost c{N,d{i,j)), and learn all items that j 
knows. 



Here d{i,j) is the distance between i and j. We treat two cases, with different choices of 
c{N,d). In section [5] we take cost function c{N,d) = c{d) satisfying 

c(l) = 1; c{d) "t oo as d ^ oo (7) 

and 

Poisson strategy. An agent's strategy is described by a sequence {9{d); d = 1, 2, 3, . . .); where 
for each d: 

at rate 9{d) the agent calls a random agent at distance d. 

In this case a simple abstract argument (section [S]) shows that the Nash equilibrium is efficient 
(without calculating what the equilibrium strategy or payoff actually is) for any c{d) satisfying 



In section [6] we take 



c{N,d) = 1; d=l 
= Cat; d> 1 



where 1 ^ ctv <C A^"^, and 



Poisson strategy. An agent's strategy is described by a pair of numbers (^noan ^far) = 9: 
at rate ^^ear the agent calls a random neighbor 
at rate 9{^^ the agent calls a random non-neighbor. 



In this case we show (I42p that the Nash equilibrium strategy satisfies 

" ' r ^~-^/^- flNash ^ A -2 



near 



for certain constants Ci)C2 depending on the reward function. So the Nash equilibrium cost 

— 1/2 

~ Cic^ , implying that the equilibrium is efficient. 



1.3.4 Plan of paper 

The two basic cases (complete graph, nearest-neighbor grid) can be analyzed directly using known 
results for first passage percolation on these structures; we do this analysis in sections [2] and [3j 
There are of course simple arguments for order-of-magnitude behavior in those cases, which we 
recall in section |4] (but which the reader may prefer to consult first) as a preliminary to the more 
complicated model "grid with communication costs increasing with distance" , for which one needs 
to understand orders of magnitude before embarking on calculations. 



1.4 Variant models and questions 

These results suggest many alternate questions and models, a few of which are addressed briefly 
in the sections indicated, the others providing suggestions for future research. 

• Are there cases where the Nash equilibrium is totally wasteful? (section 12. ip 

• Wouldn't it be better to place calls at regular time intervals? (section 17. 2p 

• Can one analyze more general strategies? 



• In the grid context of section ll.3.3| what is the equiUbrium strategy and cost for more 
general costs c{N,d)7 

• What about the symmetric model where, when i calls j, they exchange information? (sec- 
tion El]) 

• In formulas (j5|6p we see decoupling between the reward function r{u) and the function g{u) 
involving the rest of the model - is this a general phenomenon? 

• In the nearest-neighbor grid case, wouldn't it be better to cycle calls through the 4 neigh- 
bors? 

• What about non-transitive models, e.g. social networks where different agents have different 
numbers of friends, so that different agents have different strategies in the Nash equilibrium? 

• To model gossip, wouldn't it be better to make the reward to agent i depend on the number 
of other agents who learn the item from agent i? (section 17. 3p 



To model insider trading, wouldn't it be better to say that agent j is willing to pay some 
amount s{t) to agent i for information that i has possessed for time t, the function s(-) not 
specified in advance but a component of strategy and hence with a Nash equilibrium value? 



1.5 Conclusions 

As the list above suggests, we are only scratching the surface of a potentially large topic. In the 
usual setting of information communication networks, the goal is to communicate quickly, and 
our two basic examples (complete graph; nearest-neighbor grid) are the extremes of rapid and 
slow communication. It is therefore paradoxical that, in our rank-based reward game, the latter is 
efficient while the former is inefficient. One might jump to the conclusion that in general efficiency 
in the rank-based reward game was inversely related to network connectivity. But the examples 
of the grid with long-range interaction show the situation is not so simple, in that agents could 
choose to make long range calls and emulate a highly-connected network, but in equilibrium they 
do not do so very often. 



2 The complete graph 

The default assumptions in this section are 

Network communication model: Each agent i may, at any time, call any other agent j (at 
cost 1), and learn all items that j knows. 

Poisson strategy. The allowed strategy for an agent i is to place calls, at the times of a Poisson 
(rate 9) process, to a random agent. 



2.1 Finite number of rewards 

Before deriving the result ([5]) in our general framework, let us step outside that framework 
to derive a very easy variant result. Suppose that only the first two recipients of an item of 
information receive a reward, of amount Wn say. Agent strategy cannot affect the first recipient. 



only the second. Suppose ego uses rate <j) and other agents use rate 9. Then (by elementary 
properties of Exponential distributions) 

Piego is second to receive item) = — - (8) 

(j) + [n — 2)6 



and so 



We calculate 



payoff ((/), 9) = — + 



n (t)+{n-2)9 
Apayoff(^,.)= ^^-'^'^- 



((/>+ (n - 2)0)2 
and then the criterion Q gives 

(n - 2)Wn Wn 



gNash 



z^— ' 



(n — 1)2 n 

To compare this variant with the general framework, we want the total reward available from an 
item to equal n, to make the social optimum payoff — )■ 1, so we choose Wn = n/2. So we have 
shown that the Nash equilibrium payoff is 



payoff = l-0;j^*^i. (9) 



So this is a "wasteful" case. 



By the same argument we can study the case where (for fixed k > 2) the first k recipients get 

reward n/k. In this case we find 

k-1 



gNash 



k 
and the Nash equilibrium payoff is 

payoff ^ i (10) 

while the social optimum payoff = 1. Thus by taking A;„ — )• oo slowly we have a model in which 
the Nash equilibrium is "totally wasteful" . 

2.2 First passage percolation : general setup 

The classical setting for first passage percolation, surveyed in [TT], concerns nearest neighbor 
percolation on the d-dimensional lattice. Let us briefly state our general setup for first passage 
percolation (of "information") on a finite graph. There are "rate" parameters Vij > for undi- 
rected edges (i,j). There is an initial vertex vq, which receives the information at time 0. At 
time t, for each vertex i which has already received the information, and each neighbor j, there 
is chance i/ijdt that j learns the information from i before time t + dt. Equivalently, create inde- 
pendent Exponential(fij) random variables Vij on edges {i,j)- Then each vertex v receives the 
information at time 

Ty = min{Vi(,ii + Vhi2 + • • • + ^ife_ijfc} 

minimized over paths vq = iQ,ii,i2, ■ ■ ■ ,ik = v. 



2.3 First passage percolation on the complete graph 

Consider first passage percolation on the complete n- vertex graph with rates Vij = l/(n — 1). 
Pick k random agents and write S?^-, , . . . , <SJ^n for the times at which these k agents receive the 
information. The key fact for our purposes is that as n ^^ oo 

(5^1) - log n,...,^!;)- log n) 4 (^ + 5(1),...,^ + %)) (11) 

where the limit variables are independent, ^ has double exponential distribution P(^ < 2;) = 
exp(— e~^) and each S'/j) has the logistic distribution with distribution function 

^i(^) = T^-^^ -00 < X < 00. (12) 

1 + e^ 

Here — t- denotes convergence in distribution. To outline a derivation of (llip . fix a large integer 
L and decompose the percolation times as 

S^,) - log n = (tl - log L) + (5[^.) - tl + log(L/n)) (13) 

where tl is the time at which some L agents have received the information. By the Yule process 
approximation (see e.g. [T]) to the fixed-time behavior of the first passage percolation, the number 
N(t) of agents possessing the information at fixed large time t is approximately distributed as 
We^, where W has Exponential 1) distribution, and so 

P{tl <t) = P{N{t) >L)^ P{We^ >L) = exp(-Le"*) 

implying r^ — logL ~ ^ in distribution, explaining the first summand on the right side of (jlip . 
Now consider the proportion H{t) of agents possessing the information at time tl + t. This 
proportion follows closely the deterministic logistic equation H' = H{1 — H) whose solution 
is ()12p shifted to satisfy the initial condition H[Q) = L/n, so this solution approximates the 
distribution function of Su-^ — log(L/n). Thus the time SV^-s at which a random agent receives the 
information satisfies 

{S?^\ — Tl + log(L/n)) sa S'(j) in distribution 

independently as i varies. Now the limit decomposition (llip follow from the finite-n decomposition 
{[H.. 



We emphasize (llip instead of more elementary derivations (using methods of [H [13]) of the 
limit distribution for S^^n — log n because (jlip gives the correct dependence structure for different 
agents. Because only relative order of gaining information is relevant to us, we may recenter by 
subtracting ^ and suppose that the times at which different random agents gain information are 
independent with logistic distribution (I12p . 



2.4 Analysis of the rank-based reward game 

We now return to our general reward framework 

The j'th person to learn an item of information gets reward R{^) 

and give the argument for ([5]). 

Suppose all agents use the Poisson(0) strategy. In the case 9 = 1, the way that a single item 
of information spreads is exactly as the first passage percolation process above; and the general-^ 



case is just a time-scaling by 6. So as above, we may suppose that (all calculations in the n —^ oo 
limit) the recentered time S$ to reach a random agent has distribution function 

Fg{x) = Fi{ex) (14) 

which is the solution of the time-scaled logistic equation 

^=«f. (15) 

(Recall Fi is the logistic distribution (J12p ). Now consider the case where all other agents use a 
value 9 but ego uses a different value </>. The (limit, recentered) time T^^g at which ego learns 
the information now has distribution function Gs g satisfying an analog of (|15p : 



G' 

= (t>Fg. (16) 



l-G, 



To explain this equation, the left side is the rate at time t at which ego learns the information; 
this equals the rate (j) of calls by ego, times the probability Fg{t) that the called agent has received 
the information. To solve the equation, first we get 



1 - ^0,61 = exp (-</'/ Fg\ . 
But we know that in the case cj) = 9 the solution is Fg, that is we know 

l-Fg=e^v(-0 j Fg 
and so we have the solution of (I16p in the form 

l-G^,g = {l-Fgfl''. (17) 

If ego gets the information at time t then his percentile rank is Fg{t) and his reward is R{Fg{t)). 
So the expected reward to ego is 

ER{Fg{T^fi)); where <X\si{T^^g) = G^,g. 

We calculate 

P{Fg{T^^g)<u) = G^^g{F~\u)) 

= I - {I -Fg{F^\u))f I' hym 

= l-{l-ufl^ (18) 

and so 

ER{Fg{T^^g)) = [ r{u) (1 - (1 - ufl')du. 
Jo 

This is the mean reward to ego from one item, and hence also the mean reward per unit time in 

the ongoing process. So, including the "communication cost" of (j) per unit time, the net payoff 

(per unit time) to ego is 

payoff (0, 9) = -<j)+ I r{u) (1 - (1 - u)^/^)du. (19) 

The criterion ([3]) for to be a Nash equilibrium is, using the fact -^x'^' = -^^x**^' , 

1=1 / r{u) {-\og{l - u)) {I - u)du. (20) 

Jo 

This is the second equality in ([5]), and integrating by parts gives the first equality. 



Remark. For the linear reward function 

R{u) = 2{1 - u); R = l 
result ([5]) gives Nash payoff = 1/2. Consider alternatively 

Then the n — )> oo Nash equilibrium cost is 

0^^*(uo) = — / (1 + log(l - n)) du. 
uo Jo 

In particular, the Nash payoff 1 — 8^'^''^^{uq) satisfies 

1 - 6'''='='^ (no) ^ as no ^ 0. 

In words, as the reward becomes concentrated on a smaller and smaller proportion of the pop- 
ulation then the Nash equilibrium becomes more and more wasteful. In this sense result ([5]) in 
the general framework is consistent with the "finite number of rewards" result (jlOp . 

3 The N X N torus, nearest neighbor case 

Network communication model. There are A^^ agents at the vertices of the N x N torus. 
Each agent i may, at any time, call any of the 4 neighboring agents j (at cost 1), and learn all 
items that j knows. 

Poisson strategy. The allowed strategy for an agent i is to place calls, at the times of a Poisson 
(rate 9) process, to a random neighboring agent. 

We will derive formula ([6]). As remarked later, the function g{u) is ultimately derived from 
fine structure of first passage percolation in the plane, and seems impossible to determine as an 
explicit formula. But of course the main point is that (in contrast to the complete graph case) 

QNas 



the Nash equilibrium payoff R — 6^'''^ = R — 0{N ^) tends to the social optimum R. 



3.1 Nearest-neighbor first passage percolation on the torus 

Consider (nearest-neighbor) first passage percolation on the N x N torus, started at a uniform 
random vertex, with rates Uij = 1 for edges {i,j). Write {Tf^ , 1 < i < 4) for the information 
receipt times of the 4 neighbors of the origin (using paths not through the origin), and write 
Q'^(t) for the number of vertices informed by time t. Write T^ = min(T^^, 1 < i < 4). 

The key point is that we expect a N ^ oo limit of the following form 

(i;^-ri^,i<^<4; iv-2Q^(ri^); (iv-^Q^(ri^ + 1) - Q^(ri^)),o < i < oo)) 

^ (tj, 1 < i < 4; U; {Vt, < t < oo)) (21) 

where Tj, 1 < i < 4 are nonnegative with miujTj = 0; U has uniform(0, 1) distribution; <V < 
oo; with a certain complicated joint distribution for these limit quantities. 

To explain (I2ip , first note that as iV — )> cxd the differences T^ — T^ are stochastically bounded 
(by the time to percolate through a finite set of edges) but cannot converge to (by linearity of 

10 



growth rate in the shape theorem below), so we expect some non-degenerate hmit distribution 
{Ti,l < i < 4). Next consider the time Tq^ at which the origin is wetted. By uniformity of 
starting position, Q^ (T^) must have uniform distribution on {1, 2, . . . , A^^}, and it follows that 

N-'^Q^{T^) A U. The final assertion 



(iV-^(Q^^(rf +t)-Q^^(rf)),0<t<oo) A {Vt,0<t<oc) (22) 

is related to the shape theorem [llj for first-pasage percolation on the infinite lattice started at 
the origin. This says that the random set Bs of vertices wetted before time s grows linearly with 
s, and the spatially rescaled set s~^Bs converges to a limit deterministic convex set B: 

s-^Bs -^ B. (23) 

It follows that 

N-^Q^{sN) -^ q{s) as N ^oo 

where q{s) is the area of sB regarded as a subset of the continuous torus [0,1]^. Because 
N-^Q^{T^) A [/ we have 

where q~^{-) is the inverse function of g(-). Writing Q {■) for a suitably-interpreted local growth 
rate oi Q^ {•) we deduce 

(iV-2giV(ri^)^^-lg/iV(ri^)) 4 (U,q'{q~\U))) 
and so <^ holds for V = q'{q"^{U)). 

3.2 Analysis of the rank-based reward game 

We want to study the case where other agents call some neighbor at rate 6 but ego (at the origin) 
calls some neighbor at rate (p. To analyze rewards, by scaling time we can reduce to the case 
where other agents call each neighbor at rate 1 and ego calls each neighbor at rate A = (j)/0. 
We want to compare the rank M^ of ego (rank = j if ego is the j'th person to receive the 
information) with the rank M^ of ego in the A = 1 case. As noted above, M^ is uniform on 
{1,2, . . . , A^^}. Writing {£^f, 1 < « < 4) for independent Exponential(A) r.v.'s, the time at which 
the origin receives the information is 

Ti^ + min(i;^-rf + e^) 

and the rank of the origin is 

Mf = Q^(ri^) + iVg^(min(T;^ - TJ^ + if)) 

where 

Q^(i) = Ar-i(Q^(Tf + t) - Q^(rf )). 

Note we can construct (^,^, 1 < i < 4) as (\~^^-, 1 < ? < 4). Now use (j22p to see that as A^ — )> cxd 

(Ar-2Mf,Af-i(Mf -Mf)) 4 {\J,VZ{\)) (24) 

where 

Z{\) ■= min(r, + ti) - min(Ti + i\). (25) 

11 



Now in the setting where ego calls at rate (j) and others at rate 6 we have 



payoff ((/), 9) - payoff (0, 9) + {(f) - 6) = E 




R 



Ar2 



and it is straightforward to use (j24p to show this 



"1 
iV~i / (-r(n)) Zu{(b/9)du, for z„(A) := E{VZ{X)\U = u). (26) 



'0 

The Nash equilibrium condition 



-— payoff (</>, 61) 







now implies 

9T''-N-' f\-r{u)) z'^{l)du. (27) 

Jo 

Because Z{\) is decreasing in A we have z'^{l) < and this expression is of the form ([6]) with 

g{u) = -<(1) = -^ E{VZiX)\U = u)\,^, (28) 

Remark. The distribution of V depends on the function q{-) which depends on the limit shape 
in nearest neighbor first passage percolation, which is not explicitly known. Also Z{X) involves 
the joint distribution of (tj), which is not explicitly known, and also is (presumably) correlated 
with the direction from the percolation source which is in turn not independent of V. This 
suggests it would be difficult to find an explicit formula for g{u). 



4 Order of magnitude arguments 

Here we mention simple order of magnitude arguments for the two basic cases we have already 
analyzed. As mentioned in the introduction, what matters is the size of the window width we^n of 
the associated first passage percolation process We will re-use such arguments in sections [5] and 
16. H in more complicated settings. 



Complete graph. If agents call at rate 9 = 1 then by (llip the window width is order 1; so 
if 9n is the Nash equilibrium rate then the window width Wn is order l/6'„. Suppose Wn -^ oo. 
Then ego could call at some fixed slow rate cf) and (because this implies many calls are made near 
the start of the window) the reward to ego will tend to R{0), and ego's payoff -R(O) — (j) will be 
larger than the typical payoff R — On- This contradicts the definition of Nash equilibrium. So in 
fact we must have Wn bounded above, implying 9n bounded below, implying the Nash equilbrium 
in wasteful. 



Nearest neighbor torus. If agents call at rate 9 = 1 then by the shape theorem (j23p the 
window width is order N . The time difference between receipt time for different neighbors of ego 
is order 1, so if ego calls at rate 2 instead of rate 1 his rank (and hence his reward) increases 
by order 1/A^. By scaling, if the Nash equilibrium rate is 9n and ego calls at rate 29m then his 
increased reward is again of order 1/A^. His increased cost is 9]^. At the Nash equilibrium the 
increased reward and cost must balance, so ^at is order 1/A^, so the Nash equilibrium is efficient. 
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5 The N X N torus with general interactions: a simple criterion 
for efficiency 

Network communication model. The agents are at the vertices of the N x N torus. Each 
agent i may, at any time, call any other agent j, at cost c{d{i,j)), and learn all items that j 
knows. 

Here d{i,j) is the distance between i and j, and we assume the cost function c{d) satisfies 

c(l) = 1; c{d) t oo as d ^ oo. (29) 

Poisson strategy. An agent's strategy is described by a sequence {0(d); d = 1, 2, 3, . . .); and for 
each d: 

at rate 6(d) the agent calls a random agent at distance d. 

A simple argument below shows 

under condition (|29|) the Nash equilibrium is efficient. (30) 

Consider the Nash strategy, and suppose first that the window width wn converges to a limit 
Woo < oo. Consider a distance d such that the Nash strategy has ^^''^''(d) > 0. Suppose ego uses 
6(d) = 6^'^'''^(d) + (p. The increased cost is (j)c(d) while the increased benefit is at most O(woo4')^ 
because this is the increased chance of getting information earlier. So the Nash strategy must 
have ^"^"""''((i) = for sufficiently large d, not depending on N . But for first passage percolation 
with bounded range transitions, the shape theorem (|23p remains true and implies that wn scales 
as N. 

This contradiction implies that the window width w^ — >• oo. Now suppose the Nash equilib- 
rium were inefficient, with some Nash cost ^ > 0. Suppose ego adopts the strategy of just calling 
a random neighbor at rate (^at, where (/>Ar — )■ 0, 4>nwn -^ oo. Then ego obtains asymptotically 
the same reward R as his neighbor, a typical agent. But ego's cost is (pN — ?> 0. This is a con- 
tradiction with the assumption of inefficiency. So the conclusion is that the Nash equilibrium is 
efficient and wn — > oo. 



Remarks. Result (I30p is striking, but does not tell us what the Nash equilibrium strategy and 
cost actually are. It is a natural open problem to study the case of ([29]) with c(d) = d". Instead 
we study a simpler model in the next section. 



6 The N X N torus with short and long range interactions 

Network communication model. The agents are at the vertices of the N x N torus. Each 
agent i may, at any time, call any of the 4 neighboring agents j (at cost 1), or call any other 
agent j at cost cn > 1, and learn all items that j knows. 

Poisson strategy. An agent's strategy is described by a pair of numbers (^noan ^far) = 6: 
at rate 6^^^^ the agent calls a random neighbor 
at rate 6{^^ the agent calls a random non-neighbor. 

This model obviously interpolates between the complete graph model (cn = 1) and the 
nearest-neighbor model (cn = oo). 
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First let us consider for which values of cn the nearest-neighbor Nash equilibrium (^^car is 
order iV~^, ^f^r = 0) persists in the current setting. When ego considers using a non-zero 
value of ^far, the cost is order c^Of^^. The time for information to reach a typical vertex is order 
-^/6'near = -^^ ; sud SO the beucfit of using a non-zero value of 6{^^ is order Ot^^N"^. We deduce that 

if Cat ^ A^^ then the Nash equilibrium is asymptotically the same as in the nearest-neighbor 
case; in particular, the Nash equilibrum is efficient. 



Let us study the more interesting case 



K.CN <. iV^. 



The result in this case turns out to be, qualitatively 

1/2 ' 

3^ and 6'f'^^* is order c^" 



^noTr*" is order c^ and 6'f'^^* is order Cj^ . In particular, the Nash equilibrum is efficient. (31) 



— 1 /2 

"Efficient" because the cost c^Oi^^ + ^noar is order c^ . See ()32|) for the exact result. 

We first do the order-of-magnitude calculation (section 16. ip , then analyze the relevant first 
passage percolation process (section [6. 2p . and finally do the exact analysis in section [6?3l 

6.1 Order of magnitude calculation 

Our order of magnitude argument for ()3ip uses three ingredients (|32l33l34p . As in section [J] we 
consider the window width wjy of the associated percolation process. Suppose ego deviates from 
the Nash equilibrium (^nea* '^flr*) ^Y setting his 9{^^ = 0^^^^*" + S. The chance of thereby learning 
the information earlier, and hence the increased reward to ego, is order 6wn and the increased 
cost is (5c7v- At the Nash equilibrium these must balance, so 

WN >i CN (32) 

where x denotes "same order of magnitude" . Now consider the difference i^ between the times 
that different neighbors of ego are wetted. Then ij\[ is order l/^^el*- Write 6 = 6^^^^ and suppose 
ego deviates from the Nash equilibrium by setting his 9^^^,. = 26. The increased benefit to ego 
is order (.n /wn and the increased cost is 5. At the Nash equilibrium these must balance, so 
5 X In /wn which becomes 

Finally we need to calculate how the window width wn for FPP depends on (^noar^^far), and we 
show in the next section that 

v^N X e:^ei^/\ (34) 



Granted this, we substitute (I32|33p to get 

^1/3^-1/3 



CAT 

which identifies 6f^^ x c^" as stated at ([3T]l . 

6.2 First passage percolation on the N x N torus with short and long range 
interactions 

We study the model (call it short-long FPP, to distinguish it from nearest-neighbor FPP) defined 
by rates 

Uij = \, 3 Si. neighbor of i 

t2 



Xn/N , j not a neighbor of i 
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where 1 » Atv > N~^. 

Recall the shape theorem (j23p for nearest neighbor first passage percolation; let A be the area 
of the limit shape B. Define an artificial distance p such that B is the unit ball in p-distance; so 
nearest neighbor first passage percolation moves at asymptotic speed 1 with respect to p-distance. 
Consider short-long FPP started at a random vertex of the N x N torus. Write -^at.Ajv ^i' the 
proportion of vertices reached by time t and let Trofl) t>e the time at which the origin is reached. 
The event {TtQ^y < t} corresponds asymptotically to the event that at some time t — u there is 
percolation across some long edge (i,j) into some vertex j at /9-distance < u from (0,0) (here 
we use the fact that nearest neighbor first passage percolation moves at asymptotic speed 1 with 
respect to p-distance) . The rate of such events at time t — u is approximately 

where the three terms represent the number of possible vertices i, the number of possible vertices 
j, and the percolation rate z/jj. Since these events occur asymptotically as a Poisson process in 
time, we get 

1 - Fn,Xj, {t) « P(r(o,o) < i) ~ exp (-AXn f u^F^^Xj, {t - u) duj . (35) 

This motivates study of the equation (for an unknown distribution function Fx) 

l-Fx{t)=ex.p(-X (t-sfFxis) ds] , -oo < i < cx) (36) 

whose solution should be unique up to centering. Writing Fi for the A = 1 solution, the general 
solution scales as 

Fx{t):=F,{Xyh). 



So by (j35]) . up to centering 

F^,A^(t)«Fi((^A^)i/3i). (37) 

To translate this result into the context of the rank-based rewards game, suppose each agent 
uses strategy On = (0Ar,ncar;^Af,far)- Then the spread of one item of information is as first passage 
percolation with rates 



?Ar,noar/4, j & uelghbor of i 



/{N — 5), j not a neighbor of i. 



This is essentially the case above with \n = ^Ar,fai/^Af,noar5 time-scaled by ^Ar^noar, and so by ([37]) 
the distribution function F^fij^ for the time at which a typical agent receives the information is 

FN,eAt)^F,[A^lH]ile]llj). (38) 

In particular the window width is as stated at (|34p . 



6.3 Exact equations for the Nash equihbrium 

The equations will involve three quantities: 

(i) The solution Fi of ([MD- 

(ii) The area A of the limit set B in the shape theorem (I23p for nearest-neighbor first passage 
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pecolation. 

(iii) The limit distribution (cf. ([21 



{T[ - r;, 1 < i < 4) 4 (Ti, 1 < i < 4) as r ^ oo (39) 

for relative receipt times of neighbors of the origin in nearest-neighbor first passage pecolation, 
where now we start the percolation at a random vertex of p-distance ~ r from the origin. 

To start the analysis, suppose all agents use rates = (^Ar.near, ^A^.tar)- Consider the quantities 

S is the first time that ego receives the information from a non-neighbor 

T is the first time that ego receives the information from a neighbor 

F = Fjyffj^ is the distribution function of T. 

With probability — )• 1 as A^ — )■ oo ego will actually receive the information first from a neigh- 
bor, and so F is asymptotically the distribution function of the time at which ego receives the 
information. 

Now suppose ego uses a different rate cp^^f^^ / ^Ar.far for calling a non-neighbor. This does 
not affect T but changes the distribution of S to 

Pis > t) « exp f-07v,fa. I F{s) ds) 
by the natural Poisson process approximation. Because 6n,{3^i is small we can approximate 

P{S <t)^ (t>N,f.r I F{s) ds. 

The mean reward to ego for one item, as a function of (pN.iiir, varies as 

E{R{F{S)) - R{F{T))l^s<T) + constant. 
Because U = F{T) is uniform on (0, 1), in the N —^ oo limit 

EiRiFiS)) - R{F{T))l^s<T) = EiR{F{S)) - RiU))l^Fis)<u) 

du E{R{FiS)) - R{u))l(^p^s)<u) 



1 PU 

du E r{y)dy 

Jimn{F(S),u 

dy [1 - y)r{y)P{F{S) < y) 



dy [1 - y)r{y)P{S < F-\y)) 



= 4>N,ur / dy {I- y)r{y) l F{s)ds. 

Jo J-oo 

The cost associated with using (f)N,t8ir is CN4>N,t!,.r, and at the Nash equilibrium the cost and reward 
must balance, so at the Nash equilibrium F = -F/v,6»jv must satisfy 

/■I fF'Hy) 

CN^ dy{l- y)r{y) / F{s)ds. (40) 

JO J ~oo 
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Now suppose instead that ego uses a different rate (j)N,no!!,r 7^ ^Af,ncar for calling a neighbor. As 
in section [321 we set A = 4>N,neai/0N,nc3.r so that we can use rate-1 nearest-neighbor first passage 
pecolation as comparsion. For (tj) at (|39|) and independent Exponential(A) random variables (^^) 
write (as at ([25]) ) 

Z(A) := min(ri + Ci) - min(ri + C}). 

So Z{X) is the time difference for ego receiving the information, caused by ego using (/)Ar,near 
instead of ^Ar,ncar- This time difference is measured after time-rescaling; in real time units the 
time difference is Z{\)/6n^^^^^. 

As above, write T for receipt time for ego using ^Ar^ncar, and F = F^^Qj^ for its distribution 
function. Then receipt time for ego using 4>N,nc3.r is T + Z{X)/6iy^nc!,.r, so ego's rank becomes f« 
F{T)+F'{T)Z{X)/0N,nc.r, and setting U = F{T) the rank of ego is « U+F'{F^^{U))Z{X)/dN,nc.r. 
The associated mean reward change for ego is asymptotically 



PAT.ncai 



X [ r{u)F'{F~\u))du; A = </.jv,nca7^- 



where 2:(A) = EZ{X). Because the cost of using rate 4>N,nBa.T equals (j)N,nca.i, the Nash equilibrium 
condition ([3]) implies 

0%,.... ~ z'{l) f r{u)F'{F-\u)) du. (41) 

Jo 

We have now obtained the desired two equations for Fj\f^g^ at the Nash equilibrium 0i\f. Use 
([38]) to rewrite these equations (|40l4ip in terms of Fi as 

CM ~ A-y-%fr'il dy{l-y)r{y) ' F,{s)ds 



A"'0]lle]ilj{l) / r{u)F[{F^\u)) du. 
Jo 

Solving for ^Ar^near, ^Af,far WC fiud 

^7V,„ea. ~ g'/'4^', ^7V,ta. ~ ^"'Q^'c]^' (42) 

for 

Q = z'{l) U'dy {l-y)r{y) j " ' Fi{s)ds\ (j\{u)F[{F-\u)) dv\ . 

7 Variants 

7.1 Transitivity and the symmetric variant 

The examples we have studied so far have a certain property called transitivity in graph theory 
[3] . Informally, transitivity means "the network looks the same to each agent" ; formally, it means 
that for any two agents i,j there is an automorphism of the network that preserves the network 
cost structure and maps i to j. This is what allows us to assume that in a Nash equilibrium each 
agent uses same strategy. 

The general framework of section 11.11 uses the asymmetric model in which agent i calls agent 
j (at a certain cost to i) and learns all items that j knows. In the symmetric variant, agent i 
calls agent j (at a certain cost to i), and each tells the other all items they know. 
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For the transitive networks we have studied there is a simple relationship between the Nash 
equilibrium values of the asymmetric and symmetric variants of the Poisson strategies: 

^sym = 2^asy • (43 j 

The point is that the percolation process in the symmetric variant is just the percolation process 
in the asymmetric variant, run at twice the speed, and this leads to the following relationship 
between the reward when ego uses rate <j) and other agents use rate 9: 

rewardsyn,(0, 9) = rewardasy((/> + 6, 29). 

Because payoff (0, 9) = reward((/), ^) — <^ in each case, we get 

payoff 3y„,(0, 9) = ps.yoS^j4> + 0, 29) + 9 

and therefore 

^payoff ,,„,(</>, 9) = Apayoff^^^(,/, + 9, 29). 

The criterion (El) leads to (H3D. 



7.2 Communication at regular intervals 

We have studied "Poisson rate 0" calling strategies because these are simplest to analyze explicitly. 
A natural alternative is the "regular, rate 0" strategy in which agent i calls a random other agent 
at times 

Ui,Ui + \,U, + l,... (44) 

where Ui is uniform on (0, g). 

Consider first the complete graph case, and the setting (section 12. ip where (for fixed k > 2) 
the first k recipients get reward n/k. In this case, for k = 2 formula ([S]) is replaced by 

P(ego is second to receive item) = / (1 — 9u)^ (j) "^^ 

Jo 

and repeating the analysis in section 12.11 gives exactly the same asymptotics (|9|10|) as in the 
Poisson case. Consider instead the general reward framework 

The j'th person to learn an item of information gets reward R{-). 

If all agents use rate 9 then the distribution function Fg for receipt time for a typical agent 
satisfies (as an analog of the logistic equation (|15p ) 

1 - Fg{t) = f ^ ll{l-Fe{s- I)) 9 ds. (45) 

■^*~i i>0 

If ego switches to rate (p then the distribution function G^^g for ego's receipt time satisfies (as 
an analog of (fT6]l ) 



i-G^,e(t)=£^ n(i-^^(*-i)) '^ds. 



(46) 



*~0 i>0 



One can now continue the section 12.41 analysis; we do not get useful explicit solutions but the 
qualitative behavior is similar to the "Poisson calls" case, and in particular the Nash equilibrium 
is wasteful. 

Similarly, on the N x N grid with nearest neighbor interaction, switching from the "Poisson 
calls" case to the "regular calls" case preserves the order A'^"^ value of the Nash equilibrium rate 
9^^^^ and hence preserves its efficiency. 
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7.3 Gossip with reward based on audience size 

Perhaps a more realistic model for gossip is to replace Rule 2 by 

Rule 3. An agent i gets reward c whenever another agent learns an item from i. 

For the complete graph and Poisson(0) strategies we can re-use the section [TM analysis to calculate 
the Nash equilibrium. First suppose all agents use the same rate 9 and consider an agent i who 
receives the information at percentile u. For j > un the j'th agent to receive the information 
has chance ^ to receive it from agent i, and so the mean reward to agent i is (calculations in the 

n — )■ oo limit) c J^ ^ dx = — clog(l — u). Suppose now ego switches to rate (j). Then (calls incur 
unit cost) 

payoff(,^, 9) = -^ + cE{- log(l - Fe{T^,e))) 

where the time T^^o at which ego receives the information has distribution function G^^e at (J17p . 
and where Fg at (J14p is the distribution function of the time at which a typical agent receives the 
information. Now 

£;(-log(l - Fe{T^,e))) = I j^P{Fe{T^,e) < u) du 

Jo 

= [ ^(l-(l-«))<^/^dnby((l8D 
Jo 

= [\-\i-yy 

Jo 

and then we calculate 



dy 



Apayoff(0,0) = -1 - c^' MizE) (1^ dy. 

Now the Nash equilibrium criterion ([3]) implies 

91-^ ^-cj i^ log(l-y)rfy. (47) 

So switching to this "Rule 3" model preserves the wastefulness of the Nash equilibrium on the 
compete graph. 

However, for the N x N grid with nearest neighbor interaction, switching to the "Rule 3" 
models changes the efficient (^J^f'* is order A^~^) Nash equilibrium to a wasteful equilibrium with 
^Nash becoming order 1. 

7.4 Related literature 

We do not know any literature closely related to our model. As well as the epidemic and the 
gossip algorithm topics mentioned in the introduction, and classic applied probability work on 
stochastic rumors [5], other loosely related work includes 

• models where agents form networks under conditions where there are costs for maintaining 
network edges and benefits from being part of a large network [7j. 

• Prisoners' Dilemma games between neighboring agents on a graph [6]. 

One can add many other topics which are harder to model mathematically, e.g. diffusion of 
technological innovations [12] or of ideologies. 

19 



References 

[1] D.J. Aldous and S. Bhamidi. Edge flows in the compete random-lengths network. 
[a?Xiv:0708.0555 , 

[2] H. Andersson and T. Britton. Stochastic epidemic models and their statistical analysis, 
volume 151 of Lecture Notes in Statistics. Springer- Verlag, New York, 2000. 

[3] N. L. Biggs. Algebraic Graph Theory. Cambridge University Press, 1974. 

[4] R. Durrett and S. Chatterjee. Asymptotic Behavior of Aldous' Gossip Process. 
larXiv:1005.1608l 

[5] D. J. Daley and D. G. Kendall. Stochastic rumours. IMA J. Appl. Math., 1:42-55, 1965. 

[6] C. Hauert and G. Szabo. Game theory and physics. American Journal of Physics, 73(5):405- 
414, 2005. 

[7] M.O. Jackson. A Survey of Models of Network Formation: Stability and Efficiency. In 
G. Demange and M.H. Wooders, editors. Group Formation in Economics: Networks, Clubs, 
and Coalitions. Cambridge University Press, 2005. 

[8] D. Kempe, J. Kleinberg, and A. Demers. Spatial gossip and resource location protocols. J. 
ACM, 51(6):943-967, 2004. 

[9] D. G. Kendall. La propagation d'une epidemie ou d'un bruit dans une population limitee. 
Publ. Inst. Statist. Univ. Paris, 6:307-311, 1957. 

[10] J.O. Kephart and S.R. White. Directed-graph epidemiological models of computer viruses. 
In Proc. 1991 IEEE Computer Society Symposium on Research in Security and Privacy, 
pages 343-359, 1991. 

[11] H. Kesten. First-passage percolation. In From classical to modern probability, volume 54 of 
Progr. Probab., pages 93-143. Birkhauser, Basel, 2003. 

[12] E. M. Rogers. Diffusion of Innovations. Free Press, 5th edition, 2003. 

[13] R. van der Hofstad, G. Hooghiemstra, and P. Van Mieghem. The flooding time in random 
graphs. Extremes, 5(2):111-129 (2003), 2002. 



20 



